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Abstract 

We consider a model of asexually reproducing individuals. The birth and death rates of 
the individuals are affected by a fitness parameter. The rate of mutations that cause the 
fitnesses to change is proportional to the population size, N. The mutations may be either 
beneficial or deleterious. In a paper by Yu, Ethcridge and Cuthbertson (2009) it was shown 
that the average rate at which the mean fitness increases in this model is bounded below by 
log 1-5 N for any S > 0. We achieve an upper bound on the average rate at which the mean 
fitness increases of 0(\ogN/ log log N). 

Keywords: Evolutionary process, Moran Model, Selection, Adaptation rate 

1 Introduction 

In a finite, asexually reproducing population with mutations it is well-known that competition 
among multiple individuals that get beneficial mutations can slow the rate of adaptation. This 
phenomenon is known as the Hill- Robertson effect, named for the authors of [7]. One may wish 
to consider the effect on the rate of adaptation of a population when there are many beneficial 
mutations present simultaneously. It is easily observed that when such a population is finite and 
all mutations are either neutral or deleterious then the fitness of the population will decrease over 
time. This scenario is known as Muller's ratchet. The first rigorous results regarding Muller's 
ratchet were due to Haigh [6j. In an asexually reproducing population beneficial mutations are 
necessary to overcome Muller's ratchet. Yu, Etheridge and Cuthbertson |11| proposed a model 
that gives insight into both the Hill-Robertson effect and Muller's ratchet in large populations 
with fast mutation rates. 

The model introduced in pJJ| is a Moran model with mutations and selection. In our model 
there is a population of N cells and positive constants [a, q and 7 which do not depend on ./V and 
determine the dynamics of the system. The parameters must satisfy /i > 0, < q < 1 and 7 > 0. 
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Let X\ be the fitness of cell i at time t. Then X = (X 1 , X 2 , ... , X N ) is a stochastic process with 
state space Z . The system has the following dynamics: 

1. Mutation: Each individual acquires mutations at rate [i. When cell i gets a mutation it is 
beneficial with probability q and X 1 increases by 1. With probability 1 — q the mutation is 
deleterious and X % decreases by 1. 

2. Selection: For each pair of individuals (i, j), at rate jf{X l — X 3 ) + we set X 3 equal to X 1 . 

3. Resampling: For each pair of individuals (i, j), at rate 1/N we set X 3 equal to X % . 

Our model only differs from the one in [11] by allowing q = \. The upper bound we establish for 
the rate of adaptation still holds in the absence of deleterious mutations. 

Under the selection mechanism the event that X 3 is set to equal X 1 represents the more fit 
cell i giving birth and the less fit cell j dying. Likewise, the resampling event that causes X 3 to 
equal X % represents cell i giving birth and cell j dying. 

A heuristic argument in [11] shows that as N tends to infinity the mean rate of increase of 
the average fitness of the cells in X is 0(k) where k is the number satisfying 



We note that this is equivalent to being O (log N/ log log N). The heuristic argument is difficult 
to extend to a rigorous argument. Let 



be the continuous-time process which represents the average fitness of the cells in X and start X 
as centered about so that Xq = 0. The rigorous results established in [TT] are as follows: 

• The centered process X , in which cell i has fitness X = X % — X, is ergodic and has a 
stationary distribution ir. 



where E n means that the initial configuration of X is chosen according to the stationary 
distribution n. 

• For any 5 > there exists iVo large enough so that if N > Nq then i^fXi] > log 1 ^ N. 

Rigorous estimates for E w [c2] are difficult to find so other methods are needed to compute E[Xt\. 
The third result of |11] shows that if there is a positive ratio of beneficial mutations then a large 
enough population will increase in fitness over time. A paper by Etheridge and Yu [5] provides 
further results pertaining to this model. 



P = N. 




If 




is the variance of the centered process under the stationary distribution, then 



E*[X t ] = (n(2q-l)+ 7 E*lc 2 ])t 
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Other similar models can be found in the biological literature. In these models the density 
of the particles is assumed to act as a traveling wave in time. The bulk of the wave behaves 
approximately deterministically and the random noise comes from the most fit classes of cells. One 
tries to determine how quickly the fittest classes advance and pull the wave forward. This traveling 
wave approach is used in [10] and [UJ to approximate the rate of evolution as 0(log Nf log log N). 
For other work in this direction see Rouzine, Brunet and Wilke [9], Brunet, Rouzine and Wilke 
PQ, Desai and Fisher [2] and Park, Simon and Krug [8]. Using non-rigorous arguments, these 
authors get estimates of O(logiV), O (log Nf log log N), and O (log Nf (log log N) 2 ), where the 
differences depend on the details of the models that they analyze. For more motivation and 
details concerning this model, please see the introduction in [11] . 

Motivated by applications to cancer development, Durrett and Mayberry have established 
rigorous results for a similar model in [3J. They consider two models in which all mutations are 
beneficial and the mutation rate tends to as the population size tends to infinity. In one of their 
models the population size is fixed and in the other it is exponentially increasing. For the model 
with the fixed population size they show that the rate at which the average fitness is expected 
to increase is O(logiV). By considering the expected number of individuals that have fitness k 
at time t, they establish rigorously that the density of the particles in their model will act as a 
traveling wave in time. 

What we wish to establish in this paper is an upper bound for the mean rate of adaptation. 
We do this by bounding the rate at which the maximum fitness increases. Throughout the 
paper we let Xf = max{JQ : 1 < i < N} be the maximum fitness of any cell at time t and 
Xf = mm{Xl : 1 < % < N} be the minimum fitness of any cell at time t. Define the width of 
the process to be Wt = Xf — XT and define Dt = Xf — X§ be the distance the front of the 
process has traveled by time t. Our main result is given in the following theorem. 

Theorem 1. Let Xq 

large enough 



for all t > 1. 

A difference between the result in [Tl] and our result is that in [Tl] the initial state of the 
process is randomly chosen according to the stationary distribution ir while we make the assump- 
tion that all of the cells initially have fitness 0. With the initial condition Xq = for 1 < i < N, 
we let {J 7 t}t>o be the natural filtration associated with X. 

Through the use of branching processes we establish a bound on Dt that depends on the 
width. If the width is large then our bound is also large. To establish our result we will need 
to show the width is not large too long or too often. This motivates us to make the following 
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= for 1 < i < N. There exist positive constants a and b such that for N 

1 logiV 
-E[D t ]<(a + b 7 ) f 
t log log N 



definitions: 



ti = 

s n = inf{t > t n : W t > 2 log N} for n > 1 
t n = inf{t > Sn-! : W< < log N} for n > 2 
Yi = sup Z) t — D Si for i > 1 

Si<t<t i + 1 

Nt = max{i : Sj < t} for t > 

We will make use of the strong Markov property of X at the times s n and t n for n > 1. For this 
reason, many of the statements we prove below will include conditions for which Wq > even 
though according to the conditions of Theorem Q] we have Wq = 0. It is easy to see that the 
sequences {sn}^^ and {tn}^^ will be infinite with probability 1. We establish a small upper 
bound for Dt on the time intervals [t n ,s n ) and show the complimentary time intervals [s n ,t n+ i) 
are infrequent enough that the fast rate of increase of Dt during these times will not significantly 
change the mean rate. 

2 Coupling with Branching Processes 

We begin this section by giving an equivalent description of the process X in which the Poisson 
processes that determine the dynamics of X are explicit. Fix the population size N. Then X is 
a stochastic process with state space Z™. The Poisson processes that determine the dynamics of 
X £1X6 clS follows: 

• There are N Poisson processes V 1 ^, 1 < i < N, on [0, oo) of rate q[i. If V 1 ^ gets a mark at 
t then the i th coordinate of X increases by 1 at time t. 

• There are N Poisson processes V^, 1 < i < N, on [0, oo) of rate (1 — q)ii. If V 1 ^ gets a 
mark at t then the i th coordinate of X decreases by 1 at time t. 

• For each ordered pair of coordinates with i ^ j there is a Poisson process on [0, oo), 

, of rate 1/N. If V 1 ^ gets a mark at t then the j th coordinate changes its value to agree 
with the i th coordinate at time t. 

• For each ordered pair of coordinates with i ^ j there is a Poisson processes on 
[0, oo) x [0, oo), V 1 '^, which has intensity -^A where A is Lebesgue measure on M 2 . If 
gets a mark in {t} x [0, X\_ — Xl_] then the j th coordinate changes its value to agree with 
the i th coordinate at time t. 

The following proofs will involve coupling X with various branching processes. To avoid 
confusion, we will refer to individuals in X as cells and we will refer to individuals in the branching 
processes as particles. While the cells in X each have an integer value that we refer to as the 
fitness of the cell, the particles in a branching process will each be given an integer value that 
we refer to as the type of the particle. Let Z c = {Zf}t>o be a multi-type Yule process in which 
there are initially N particles of type 0. Particles increase from type i to type i + 1 at rate \x and 
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branch at rate C. When a particle of type i branches the new particle is also type i. Let Mf be 
the maximum type of any particle at time t. 

Before proving a lemma about the branching process Z , we first prove a lemma which follows 
from Taylor's Remainder Theorem that we will make use of several times. 

Lemma 2. Let x > 0. The tail of the exponential series satisfies 

E"° x 1 x k e x 
1\ - ~kT' 

i=k 

Proof. By Taylor's Remainder Theorem we know that there exists a £ £ [0, x] such that 

k—l a up 

x sr-^ x x e ? 

6 = ^l\ + ~kV 

i=i 



Using the series expansion of e x we have 



E 



i\ k\ ~ k\ 

i=k 



Lemma 3. For any population size N, time t > 0, and natural number I, 



□ 



P(Mf >l)< . 

Proof. Consider a Yule process Z which is the same as Z c except there is only one particle at 
time 0. It is well known that the number of particles in Zt has mean e ct . Let M[ be the maximum 
type of any particle at time t. When there are k particles in the population, we let B%, . . . , 
denote the types of the particles, where the numbering is independent of the mutations. For any 
I > 0, 

oo 

P(M' t > I) = Y J p ( M t > l\ z t = k)P{Z t = k) 
k=i 

oo 

= yj F({B t > !} U ■ ■ ■ U {B k > l\\Z t = k)P(Z, = k) 



E 

By Lemma [2] it follows that 



fc=i 

oo 

< Yj ^P{Bi > l)P{Z t = k) 

k=l 

= E[Z t ]P(B 1 > I) 

oo 

(//')' 

e 



ct \^ (t^y 



i=i 



p{m[ >i)< MfL. 
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Now consider Z c . At time label the particles 1,2, ... ,N and let M- t be the maximum type 
of any particle among the progeny of particle i at time t. Then 

P{M? >l) = P{{M' U > 1} U • • • U {M' N t > I}) 
< NP(M' l t > I) 
N(tn) l e ct 

- T\ ' 

□ 

The next proposition will give a lower bound on the fitness of any cell up to time t given that 
we know the least fitness at time is Xq . We do this by establishing an upper bound on the 
amount that any cell will decrease in fitness. Let St = sup < s <t(^o" — X~). 

Proposition 4. For any population size N , time t > 0, and natural number I, 

iV(i/i) z e* 



Proof. By Lemma [3] we have 



P(S t >l)< 
P(Ml >l)< 



11 ' 

N(tp) l e l 



for any population size N, time t > and natural number I. To complete the proof we establish a 
coupling between X and Z 1 such that for any population size iV and time t>0we have M t > St- 
At all times every cell in X will be paired with one particle in Z . The coupling is as follows: 

• We initially have a one-to-one pairing of each cell i in Xq with each particle i in Zq. 

• The particle in Z 1 that is paired with cell i will increase in type by 1 only when cell i gets 
a mutation. 

• For each cell i in X, at rate (N — 1)/N cells j ^ i are replaced by cell i due to resampling 
events. If cell % replaces cell j due to resampling, then the particle labeled i in Z 1 branches. 
If particle i has a higher type than particle j then the new particle is paired with cell j. 
The particle that was paired with cell j before the branching event is no longer paired with 
any cell in X. If particle i has a lower type than particle j then the particle that was paired 
with cell j remains paired with cell j and the new particle is not paired with any cell in X. 

• The particle paired with cell i in Z 1 branches at rate 1/N and these branching events are 
independent of any of the events in X. When the particle paired with cell i branches due 
to these events the new particle is not paired with any cell in X. 

• Any particles in Z 1 that are not paired with a cell in X branch and acquire mutations 
independently of X. The selection events in X are independent of any events in Z l . 

Let R\ be the type of the particle in Z\ that is paired with cell i. To bound St we consider 
Si = sup < r < s (X ~ - X*) for 1 < i < N. To show M} > S t it is enough to show R\ > Sf for all i. 
Initially Sq < R l Q = for all i. Note that both s ^ SI and s t- > Rl are increasing functions and 
that increases in these functions correspond to decreases in X\. 
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When cell i gets a mutation, R l increases by 1. However, if cell i gets a mutation at time s 
then S % will only increase by 1 if S l s _ = Xq — X l s _ and the mutation is deleterious. Therefore, if 
cell i gets a mutation at time s and Sl_ < R l s _ then 

si < si_ + 1 < + 1 = 

Suppose cell j is replaced by cell i due to a resampling event at time s and that both S J S _ < R J S _ 
and Sl_ < R\_ hold. With probability 1 we have S { s = Sl_ and i?< = R\_. If -X] < 5f_ then 
S2_ = S j s . From this it follows that Si < R j s . If Xjf - X l s > S{_ then S 3 S = Xq - X\ < SI < 
If R\ > R J S _, then by the definition of the coupling, R{ = R\. If R L S < R J S _, then by definition of 
the coupling, R{ = R s _. Therefore, Rl > R\ which gives us Si < R J S . 

Selection events will never increase S l and since S -1 and R l are increasing in time a selection 
event at time s will preserve the inequality S l s < R l s . This shows that any event that occurs at 
time s which may change the fitness of a cell i in J will preserve the inequality S l s < R\. Since 
the result holds for each individual i we have S t < M\. □ 

We now wish to bound the distance the front of the wave moves as a function of the initial 
width. To do so we define a new branching process Z k ^ which has the following dynamics: 

• Initially there are N particles of type k in Zq . 

• Each particle changes from type n to n + 1 at rate fi. 

• A particle of type n branches at rate jn + 1 and upon branching the new particle is also 
type n. 

Let M^^ be the maximum type of any particle in and let M t fc '^ = M t '^—k, so that Mq'^ = 0. 



Lemma 5. For any time t > and any integers k > and I > we have 

P(M t fc,t >l)< 



r/ . ; , N(t f i) l e^ k+l ^ t 



Proof. While all of the particles in Z t have type less than k + I they branch at a rate which is 
less than or equal to j(k + l) + l. Because of this, P(M t fc,t > I) < P(M^ h+l)+1 > I). By Lemma 
[3] we have 

P(M? {k+l)+1 >l)< . 

□ 

Proposition 6. For any fixed time t > and any integer I > we have 

. 2N(tu) l e^ w « +2l ^ + V t 
P( sup D S >1)< VW „ • 

Proof. We first establish a coupling between X and Z Wo+k <' for each integer A; > 0. For k > 1 
define T fc = inf{t : 5t > fc}. Every cell in X will be paired with one particle in Z Wo+k ^ until 
time T k . We couple Z Wo+fc ' t with X for all times t G [0,T fc ) as follows: 
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• We initially have a one-to-one pairing of each cell i in Xq with each particle i in Z Q 

• The particle in z Wo+k '^ that is paired with cell i will increase in type by 1 only when cell 
i gets a mutation. 

• For each cell i in X, at rate (N — 1)/N cells j ^ i are replaced by cell i due to resampling 
events. If cell i replaces cell j due to resampling, then the particle labeled i in Z Wo+k ^ 
branches. If particle i has a higher type than particle j then the new particle is paired 
with cell j. The particle that was paired with cell j before the branching event is no longer 
paired with any cell in X. If particle i has a lower type than particle j then the particle 
that was paired with cell j remains paired with cell j and the new particle is not paired 
with any cell in X. 

• Additionally, the particle paired with cell i in Z Wo+k ^ branches at rate 1/N and these 
branching events are independent of any of the events in X. When the particle paired with 
cell % branches due to these events the new particle is not paired with any cell in X. 

• In X there is a time dependent rate jUl at which cells j ^ i are replaced by cell i due to 
selection events. If cell j is replaced by cell i in X due to a selection event then the particle 
labeled i in z^ 0+k ^ splits. If particle i has a higher type than particle j then the new 
particle is paired with cell j. The particle that was paired with cell j before the branching 
event is no longer paired with any cell in X. If particle i has a lower type than particle j 
then the particle that was paired with cell j remains paired with cell j. The new particle 
is not paired with any cell in X. 

• Additionally, a particle labeled % in Z Wo+k ^ splits at a time dependent rate j(Rl' k — Ul) 
where R l t ,k is the type of particle i. These branching events are independent of any of the 
events in X. When such a branching event occurs, the new particle is not paired with any 
cell in X. 

• Any particles in Z Wo+k ^ that are not paired with a cell in X branch and acquire mutations 
independently of X. 

Fix k > 1. For the above coupling between X and Z Wo+k ^ to be well defined until time T k , 
we need R^ k — Uj > for all i € {1, ... , N} and for all times t G [0, T k ). Let 

T k < 1 = inf{t : Rf - U* < 0}. 

The coupling between X and Z Wo+k ^ is well-defined until time T k = min {T k,i : 1 < i < N}. We 
will show that T k <f k . 

Let S\ = sup < s < t (Xi - X+) and let R^ k = R^ k -W -k. Initially S l < R^ k = for all i. 
Note that both t ^ S\ and t H- R f are increasing functions, from which it follows that 1 1— >■ R t ' 
is also an increasing function. 

When cell i gets a mutation, R l,k increases by 1. However, if cell i gets a mutation at time s 
then S l will only increase by 1 if S l s _ = X\_ — Xq and the mutation is beneficial. Therefore, if 
cell i gets a mutation at time s and Sl_ < R l g k then 

Sl < Sl_ + 1 < R^ k + 1 = Ri k . 
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Suppose cell j is replaced by cell i due to a resampling or selection event at time s and that 
both S 3 < R j s ' k and = S*_ < R^ k = hold. If X* - X+ < then ^_ = 5|. It follows 
that S j a < Ri ,k . If XI - X+ > S 3 S _ then §{ = Xq - X* < < Rl' k . If j&* > flj'*, then by 
the definition of the coupling, R J s ' k = Rl' k . If Rl ,k < Rg' k , then by definition of the coupling, 
Ri' k = R?g k . Therefore, Ri ,k > Ri ,k which gives us Si < R 3 s k . 

For any time s < T k we have Rl' k > SI + W + k ^ X* - X+ + W + k = X\ - X ~ + k. If 
there were N cells with fitness Xq — k at time s G [0, T k)l ) then the rate at which cell i replaces 
these N cells due to selection is 7(X^ — X^ + k). However, for any time s < T k there are fewer 
than N cells being replaced by cell i due to selection and they will all have fitnesses at least as 
large as Xq — k. This gives us a bound on the rate at which resampling events occur on cell i 
before time T k , namely U* < X'' l s - Xq + k < flj* for all s G [0, T k ). This shows that T k < f k/l 
for all i. Hence, T k < T k and the coupling is well-defined until time T k . 

We have shown that any event that occurs at time s G [0,T k ) which may change the fitness 
a cell % in X will preserve 1 
i, for any s G [0,T fc ) we have 



of a cell % in X will preserve the inequality S\ < Since the result holds for each individual 



sup D r = sup SI < sup J2j' fc < M™ 0+k *. 

0<r<s l<i<N l<i<N 

Note that if sup < s < t (X^~ — X~) < k then t < T k . Therefore, on the event 

{ sup (Xq - X-) < k} 

0<s<t 

we have M^ Vo+k ^ > sup < s <t D s . This allows us to do the following computation: 

oo 

P{ sup D s > I) = V P({ sup D s > 1} n { sup (X ~ - X s ~) = »}) 

0<s<t 0<s<t 0<s<t 

oo 

< £ P({Mf > /} n { sup (X ~ - xj) = •}) 

r-r o<s<i 
«=o — — 

oo 

< £p({Mf° + ^ > n { sup (Xq - xt) > i}) 

n 0<s<t 
i=\j — — 

oo 

< P(Mr° +ht > l) A P( sup (XT -X;)> i) 

< f>(Mf° +i ' f > A ( ^Mf ^j by Propositions 

< £ f f| j A (-^p-) by Lemma 



i=0 



< iv(^)^(7(^ + o + i)t - gi7t et ^m 



i=0 i=| 
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< [m i e ht + Ne ty WJL 

V. ^ i\ 

1=1 

N(tfi) l e^ w ° +2l ^ t N(t t i) l e^ t , 

< v (/_!)! + z! b y Lemma Ei 

□ 

We now extend the bound we got on the least fit individuals in Proposition H] to a slightly 
stronger result. Let i£Z and let Sf C {1,2,..., N} correspond to a collection of cells at time t 
which is determined by the following dynamics: 

• Initially Sq consists of all cells whose fitness lies in the interval (x, oo). 

• If a resampling or selection event occurs at time t and a cell not in Sf_ is replaced by a cell 
in Sf_ then it is added to Sf. 

• If a beneficial mutation occurs at time t on a cell not in Sf_ that causes its fitness to increase 



from x to x + 1 it is added to S: 



t ■ 



If a resampling event occurs at time t to a cell in Sf_ and it is replaced by a cell not in S- 



then it is removed from Sf. 



Mutation and selection events do not cause cells to be lost from S x . We now prove the following 
corollary to Proposition [6j 

Corollary 7. Let A l t be the event that a cell in Sf has fitness in (— oo, x — I] for some time 
s G [0, t]. For any time t > and any integer I, 

F ^ A t) ^ • 

Note that we cannot use the bound found in Propositions] because cells not in Sf may move 
to Sf due to selection events. Proposition [J] does not say anything about what happens if the 
population of cells with lower fitness in S x is increasing rapidly due to selection events with cells 
of even lower fitness outside of S x . 

Proof of Corollary [7^ For k > 1 let X be coupled with Z Wo+k '^ as in the proof of Proposition [6j 
Let T k , R l t ' and R]' be defined as they were in the proof of Proposition [6l Define 

Tj = {r G [0, 8 ] : * G Sf} 

and let 

5i _ 
Q — 



sup rg -fi(x - X l r ) if T l s ^ 
-oo S if ft = 



The goal is to show that for all s G [0, T k ) we have 



sup S\< sup Rf < Mf' 0+fe ' t . 

Ki<N Ki<N 



10 



Note that we can only consider the coupling of X with Z Wo+k ^ until time T k because after this 
time the coupling is not well-defined. 

Initially all of the cells in Sq have fitness in (x, oo). Therefore, if i G Sq then Sq < = R^ ■ 
If ii S% then Si = -oo < R^ k . 

Suppose cell i gets a mutation at time s and that for any time s' G [0, s— ) we have S l s , < R 1 ^. 
Then R l,k increases by 1. If i G SJ_ then 5* will only increase by 1 if Sl_ = x — X\ and the 
mutation is deleterious. If i ^ <Sg_ and the mutation does not cause the fitness of cell i to change 
from x to x + 1 then S 1 * = If i £ S*_ and the mutation does cause the fitness of cell i to 
change from x to x + 1 then S 1 * = S l s _ V 0. In any of these three cases, S z s < R l g k . 

Suppose cell j is replaced by cell i due to a resampling or selection event at time s and that 
S 3 S _ < flj'* and < If i $ S x s _ then = S{ < R j ^. Suppose i G If x - X l s < S 3 S _ 

then S J S _ = S J S . From this it follows that Si < R j s . If x - X\ > SL then St = x-Xt< Si < R\. 



If R\ > R s _, then by the definition of the coupling, R{ = R\. If R l s < R s _, then by definition of 
the coupling, R{ = B? s _. Therefore, R{ > R\ which gives us Si < R 3 S . 

Note that if sup 0<s<t (X^~ — X~) < k then t < T k . Therefore, on the event 

{sup (X -X-)<k} 

0<s<t 

wc have Mf ° +fc ' f > 

su Pi<j<Ar &s- This allows us to do the following computation: 



P( sup sup 5* > /) = P({ sup sup S] > 1} n { sup (Xq" - Z7) = i}) 

0<s<tl<i<N ~^ 0<s<tl<i<N 0<s<t 

oo 

< Y,p({mY ^ > 1} n { sup (x - - x-) = i}). 



0<s<t 

i=\j — — 



This is the same bound as equation ([T]) in the proof of Proposition [6l Therefore, we have 
established the same bound. □ 



3 Bounding the rate when the width is small 

We define a process X' which we will couple with X when the width is small. To define X' we first 
define an i.i.d. sequence of continuous-time stochastic processes {2"}^ =0 where the distribution 
of each Z n is the same as the distribution of Z 2logN '^. Let be the maximum type of any 
particle in and let = Mf - 2 log N so that Mq = for all n. Define 

f X+ + M° t if* €[0,1] 
* \ X', f , + -M£_j if t G (i, £ + 1] for any integer i > 1 

and D' t = X[ — Xq . The idea is that D' t is the maximum type of any particle in a branching 
process X' that has the same distribution as Z 2 ^ ogN ^ except that at each integral time unit we 
restart the branching process so that there are once again N particles. At each integral time unit 
t the ./V particles initially have type D' t which is the maximum type achieved by any particle in 
X[ up to time t. 
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Proposition 8. Let a > 1 + 2j. For N large enough we have 

sup -E[D' t ] < 2ak. 

te[i,oo) * 

Proof. By definition D[ has the same distribution as M^ l ° eN '^ so by Lemma Owe have 

P(D[ >l)< £L 



Then 



k 

By Lemma [2] we have 

E[D'i) 



E[D\] 1^ , , „ 1 



Z=0 



ak + 



^1+27^^+1 



k 



<a + 



E 



k l\ 

l=fak] 



k\ak]\ 



Note that for any k > 2 both D' k — D( , and D[ have the same distribution, namely that of 
M 21ogJV,t_ Choose t€[k,k + l) for some k > 1. Because Z?£ is increasing in we have 

^<^(£> , fc+1 - J D / fc + D , fc -... + ^-£>i + £>!)■ 

Therefore, 

^[^]<^^ J Di]<2^[ J Di]. 
Dividing each side by k and using the bound on E[D[] gives us 

1 r h 2N 1+2 ^ u a ~ k e «7fc+^ 7 

H £[D '^ 2a+ irpTi 

By Stirling's formula we have 



k\ak]\ V2^k\ak]^+ 1 / 2 

as long as a > 1 + 2j. 

Proposition 9. Let X l = for 1 < i < N. Then 



as N — > 00, 



□ 



1=1 



/or all times t > 0. 
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Proof. We now couple X with X' by coupling X with the sequence of processes {Z m }™ =0 . Let 

oo oo 

I m = (m, m + 1] n (J [t n , s n ) and J m = (0, 1] n (J [t„ -m,s n - m). 

n=l n=l 

For any m > we couple X and Z m as follows: 

• The particles in Z™ are labeled 1, 2, . . . , N. 

• For any time in the process X behaves independently of i? m . For any time in the 
process Z m behaves independently of the process X. During the time J^, if a particle 
labeled i in Z m branches, the particle remains labeled i and the new particle is unlabeled. 

• The particle in Z m that is paired with cell i will increase in type by 1 at time t G J m only 
when cell i gets a mutation at time t + m £ I m . 

• For each cell i in X, at rate (TV — 1)/N cells j / i are replaced by cell i due to resampling 
events. If cell i replaces cell j due to resampling at time t G I m then the particle labeled 
i in Z m branches at time t — m £ J m . If particle i has a higher type than particle j then 
the new particle is paired with cell j. The particle that was paired with cell j before the 
branching event is no longer paired with any cell in X. If particle i has a lower type than 
particle j then the particle that was paired with cell j remains paired with cell j and the 
new particle is not paired with any cell in X. 

• The particle paired with cell i in Z m branches at rate l/N for all times t G J m and these 
branching events are independent of any of the events in X. When the particle paired with 
cell i branches due to these events the new particle is not paired with any cell in X. 

• In X there is a time dependent rate jUl at which cells j ^ i are replaced by cell i due to 
selection events. If cell j is replaced by cell i in X due to a selection event at time t G I m 
then the particle labeled i in Z m splits at time t — m G J m . If particle i has a higher type 
than particle j then the new particle is paired with cell j. The particle that was paired 
with cell j before the branching event is no longer paired with any cell in X. If particle i 
has a lower type than particle j then the particle that was paired with cell j remains paired 
with cell j. The new particle is not paired with any cell in X. 

• A particle labeled i in Z m splits at a time dependent rate j(Rl' k — Uj) for all times t G J m 
where R l t ,k is the type of particle i. These branching events are independent of any of the 
events in X. When such a branching event occurs, the new particle is not paired with any 
cell in X. 

• Any particles in Z m that are not paired with a cell in X branch and acquire mutations 
independently of X. 

Observe the following bound for D t : 

N t -1 N t 

D t <Yl ( D U+i ~ D si) + " AJ + sup (D s - D SNt ) + sup (D s - D tNf+1 ), 

l— l i—i SM t <S<tM t +l tM t +l<S<t 



13 



where we consider the supremum over the empty set to be 0. By definition we have 



N t -1 N t 
J2(D ti+1 -D Si )+ sup (D s -D SNt )<Y,Yi. 

i=l s Nt <s<t Nt+1 i=1 



To finish the proof we will show 



sup (O, -A,)+ sup (D, - D, ) < D' t . 

-y ti<S<Si t Nt+1 <S<t 



To do this we define 



M t = J2 sup (D s -D ti )+ sup (D s -D tNt+1 ) 

i=1 U<s<Si *jv t + i<«<* 

for all times t > 0. Suppose M s < .D^, for all s G [0, i) and a mutation, resampling or selection 
event occurs in X at time t. If i G (sj, fj+i) for some i > then Mj_ = Mf because the process M 
does not change on these time intervals. It is possible that D' t changes, but D' t can only increase. 
Therefore, D[ > Mt. If t G [ti,Si] fl (m,m + 1] for some i > and m > then at time t the 
processes X and X' are coupled. More precisely, X and Z m are coupled and the coupling has 
the same dynamics as the coupling in Proposition O except the time shift. The same argument 
used in Proposition [6] shows that D' t > Mt whether the cell changed fitness due to mutation, 
resampling or selection. Since this inequality is preserved on any event that may change Mt, it 
is true for all times t. □ 



4 Bounding E[Yi\F Si ] 

For this section we let f(N) = log log log N. 

The first goal of this section is to show that when the width is large enough the selection 
mechanism will cause the width to decrease quickly. We give a labeling to the cells that will help 
us in this regard. Define the following subsets of R: 

h = (-oo,X+- ^W ) 

h = (X+-^W ,X+-^W ] 

h = (X+-^W ,X+-±W ] 

h = (X+ -^W ,oo) 

We will label each cell in Xq with two labels. For the first labeling, we use a to label the cells 
in /i U ^2, we use b to label the cells in I3 and we use c to label the cells in I4. For the second 
labeling we use a' to label the cells in I\, we use b' to label the cells in I2 and we use c' to label 
the cells in I3U I4. 

Let 2lj, Q3j and <£j denote the number of cells labeled 0, b and c at time t, respectively. Let 
2l£, %$' t and denote the number of cells labeled a', b' and c' at time t, respectively. 
The cells change labels over time according to the following dynamics: 
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• Mutations: If the fitness of a cell labeled o increases so that it is in ^3 then the cell is 
relabeled b. If the fitness of a cell labeled a' increases so that it is in I2 then the cell is 
relabeled b'. Likewise, if the fitness of a cell labeled b increases so that it is in I4 then it 
is relabeled c and if the fitness of a cell labeled b' increases so that it is in ^3 then it is 
relabeled c'. Deleterious mutations do not cause cells to be relabeled. 

• Resampling: Any resampling event in which cell i is replaced by cell j causes cell i to inherit 
the labels of cell j. 

• Selection: If a cell labeled a is replaced due to a selection event it inherits the corresponding 
label of the cell that replaced it. If a cell labeled a' is replaced due to a selection event it 
inherits the corresponding label of the cell that replaced it. If a cell labeled b is replaced 
by a cell labeled c due to a selection event then the cell that was labeled b is relabeled c. 
If a cell labeled b' is replaced by a cell labeled c' due to a selection event then the cell that 
was labeled b' is relabeled c'. Any other selection events do not cause the labels of the cells 
to be changed. 

Let A\ be the event that there is a cell labeled b with fitness in (—00, Xq — Wo) for some time 
t G [0, f(N)]. Let A2 be the event that there is a cell labeled c with fitness in (— 00, Xq — ^Wq) 
for some time t G [0, f(N)]. Let A\ be the event that there is a cell labeled b' with fitness in 
(—00, Xq — j|j Wo) for some time t G [0, f(N)]. let A' 2 be the event that there is a cell labeled c' 
with fitness in (— co,Xq — ^Wq) for some time t G [0, f(N)]. 

Lemma 10. Consider any sequence of initial configurations Xq such that Wq > log iV for all N . 
Then 

P(A l UA 2 U A\ U A' 2 ) -)■ as N -> 00. 

Proof. First we show the result for A\. We apply Corollary [7J with x = Xq — 2Wo/16, t = f(N) 
and I = Wo/32. Because x = Xq — 2 Wo/16 we have that Sq consists of all the cells labeled b or 
c. Setting t = f(N) and I = Wo/32 will make A\ the event that a cell labeled b or c has fitness 
less than Xq" — ^W) by time f(N). Note that according to the relabeling dynamics, cell i being 
labeled b or c is equivalent to i G S x . Therefore, A\ C A\ and we get 

2N(tu) l e^ Wo+2l ^ +1 ^ 

We can apply Corollary [7] with x = Xq — 1/16, t = f(N) and I = Wo/32 to get the same 
bound for P(A2). By choosing x, t and I in this way the event A\ is the event that a cell labeled 
c has fitness less than X + (0) — JjWq by time f(N). This shows that P(A2) also tends to as N 
tends to infinity. 

Likewise, to show P{A' l ) tends to as N goes to infinity we can apply Corollary [7J with 
x = Xq — jqWq, t = f(N) and I = W /32, and to show P{A' 2 ) tends to as TV goes to infinity 
we can apply Corollary [7J with x = Xq — ^W , t = f(N) and I = W /32. □ 

Lemma 11. Consider any sequence of initial configurations Xq such that Wq > logiV for all N . 
Let T be a stopping time whose definition may depend on N such that £' T > N/4 for all N . Let 
B T = mi{t > T : Xf > X+ - W /4}. Then 

2' 



P(B T l {T<kf{N)} > -f(N)) -> as N -> 00. 
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Proof. Let A' 3 be the event that €[ > N/5 for all times t G [T,T + |/(JV)). The only way for a 
cell labeled c' to change its label is for it to be replaced by a cell labeled a' or b' via a resampling 
event. The rate at which cells marked c' undergo resampling events with cells marked a' or b' at 
time t is 

g£(iv-g) JV 

JV - 4 ' 

Let {t7 n }^ be a simple random walk with Uq = JV/4 < <£' T . Let T < t\ < ti < . . . be the 
times at which cells labeled c' are involved in resampling events with cells that are not labeled 
c' after time T. We couple {U n }^ =0 with X so that if at time t n a cell is labeled c' due to a 
resampling event then U n = U n -\ + 1. If at time t n a cell loses the label c' due to a resampling 
event then U n = U n -\ — 1. To have U m < N/5 for some m satisfying < m < n we will need 
maxo< m < n \ U m — Uq\ > N/20. It follows from the reflection principle that there exists a constant 
C such that S[maxo< m <n \U m — Uq\] < Cy/n for all n > 0. By Markov's inequality, 



P ( max \U m -U \> N/20) < C^/N 

\0<m<n J 



for some constant C. 

Let R be the number of resampling events that occur in the time interval [T, T + ±f(N)) that 
involve pairs of cells such that one is labeled c' and the other is not. By the bound on the rate 
at which resampling events occur of N/ 4 and Lemma [2] we have 

P(R > k)< V ( jV /( jV )) te " JV/(jV)/8 < (Nf(N)) k 
{ } ~ 2^ 8H\ ~ 8 k k\ 



Then 



i=k+l 



P((A' 3 f) < P({ max \U m - U \ > N/20} n {R < N 3 / 2 }) 
0<m<R 

+ P({ max \U m - U \ > N/20} n {R > N 3/2 }) 

0<m<R 

< P({ max \U m - U \ > N/20}) + P(R > iV 3/2 ) 

0<m<AT 3 / 2 



< JL + ( N f( N )) N3/2 



N i/4 8 jv 3 / 2 pjv 3 / 2 ] ! 
-)■ as JV — > oo. 

Let A4 be the event that % = for some time t G [T, T + |/(JV)). Notice that if S(£ = then 
2lg = for s > t. Therefore, ^4 is the event that the label a' is eliminated by time T + ^f(N). 
By the given dynamics 2l£ can only increase when cells marked a' replace cells marked b' or c' via 
resampling events. At time t the rate at which this happens is 

1 .« 

2 JV — V ; 

We define the event E as 

E = (A[f n (A' 2 f n 4 n {T < i/(JV)}. 
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Selection will cause 21' to decrease. On the event (A' 2 ) c all of the cells marked c' will have 
fitness at least g^Wo greater than any cell marked a until time f(N). Thus, on the event 
(A'2) n {T < 7jf(N)} all of the cells marked c' will have fitness at least greater than any 

cell marked a for all times t G [T,T + ^f(N)). On the event A' s there are at least iV/5 cells 
marked c for all times t G [T, T + ^f(N)). Hence, on the event E cells marked a! will become 
cells marked c' by a rate of at least 

■>*'-< w ° > JLwiaj ( 3 ) 

32iV - 160 * V ; 

for all times t € [T,T + %f(N)). 

Let {C/^} be a biased random walk which goes up with probability 

, _ 160 
P ~ 160 + jW 

and down with probability 1 — p'. Let N be large enough so that p' < 1/2. 

Because the random walk is biased downward, the probability that the random walk visits 
a state j < Uq is 1. Once the random walk is in state j, it goes up 1 with probability p' and 
will eventually return to j with probability 1. The random walk will go down 1 with probability 
1 — p 1 and, from basic martingale arguments, the probability that it never returns to j again is 
(1 — 2p')/(l —p'). Therefore, once U' is in state j, the probability it never returns to state j is 

1—p' 

Hence the number of times U 1 visits a state j < Uq has the geometric distribution with mean 
1/(1 — 2p'). For more details see pages 194-196 of [3]. By equations ([2]) and ([3]) we see that on 
the event E, if 21' changes during the time interval [T, T + ^f(N)) it decreases with probability 
higher than p'. The expected number of times that 21' will visit state j is therefore less than or 
equal to 1/(1 — 2p') for any j G {1, 2 . . . , N — 1}. Also, the rate at which 2l£ changes state is at 
least 

for all times t G [T, T + ^f(N)) by equation ©. Let A = {t > T : % > 0} and let A be Lebesgue 
measure. Then 

N 



as iV — > 00. 

Observe that 



rwTx , 160 ^1 160 log N 



P(E n (^) c ) = p (e n j A(i) > l -f{N) 
= p(\(A)i E >±f(N) 

< jTiV) ^ ^ ^ ar ^ ov ' s Inequality 

-)> as iV 00. 
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Therefore, 



P(E nA' 4 )-p(r< |/(AT)J -> as TV 
This allows us to do the following computation: 



limJP(T< -f(N) )+P(T> -f(N) 



oo. 



lirn^ (P(£ n +P (t > ~f(N)X\ 

^ (i 3 ((4) c n (4>) c n 4 n ^ n |r < i/(i\r)l 



= ^Too P (^V<|/(iv) } <^^))- 

□ 

Let S = inf{t : X~ > X+ - W /4}. 

Proposition 12. Consider any sequence of initial configurations Xq such that Wo > logiV for 
all N. As N tends to infinity, 

P(B > f(N)) -> 0. 

Proof. First note that if QSo + £o > iV/4 then, because all of the cells labeled b or c at time are 
also labeled c', we have that £q > N/4. The result then follows by Lemma [TT] with T = 0. On 
the other hand, if <B + C < W/4 then 2l > 32V/4. 

Let T = (mf{t : 21* < iV/4}) A (inf{f : C t > iV/4}). Let A 5 be the event that QL t > N/4 for all 
times t € [0, ^f(N)). Let A 6 be the event that £ t < iV/4 for all times t £ [0, |/(iV)). Define C to 
be the infimum over all times such that a cell labeled b has fitness in (— oo,Xq — JjWo), a cell 
labeled c has fitness in (-co, X^ - ^W ), or 2l t < N/4. Note that AfnA^f]A 5 C {( > |/(iV)}. 

On the event {£ > ^/(iV)} the rate of increase of Ct due to selection is at least 

^2iV- ^ 128 7W (4) 

for all i G [0, ^f(N)). On the other hand, because £t can only decrease due to resampling, £t 
will decrease no faster than 

1 CtjN-Ct) 

Y — n — ~ u (5) 

Let {U n }^ =0 be a biased random walk with Uq = 1 which goes up with probability 

-yW 



P 



128 + 7W0 



and down with probability 1 — p. Let N be large enough so that p > 1/2. By similar reasoning 
as used in the proof of Lemma [TT| the number of times U n visits a state j ' > 1 has the geometric 
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distribution with mean l/(2p — 1). Also, by basic martingale arguments, the probability that U n 
ever reaches state is 

1-p _ 128 
P 7W0 ' 

Note that £0 > Uq since the cell with the highest fitness is initially labeled c. On the event 
{C > |/(JV)}, we see from equations ((H) and © that if £ changes during time [0, ^f(N)) then it 
increases with a probability of at least p. Therefore, the expected number of times that £ visits 
state j is less than or equal to l/(2p — 1) and the probability the reaches state for some time 
t € [0, \ f{N)) is less than 128/(7Wo). Let Aj be the event that <tt reaches state for some time 
t € [0, |/(AQ). 

By equation (UJ), the rate at which (£ changes is at least 

for all times t 6 [0, |/(JV)) on the event {C > j/(JV)}. Let C = {t e [0, ^/(iV)) : £ < ±N} and 
let A be Lebesgue measure. Then 

E[\(C)l{£>f( N y 2 y] = ^[^(C , )1{C>/(AT)/2} 1 A 7 ] + E [KC)l{(>f(N)/2}lAc] 

1 198 W4J 1 

< ~f(N)P(A 7 ) + "° £ 1 

2 (2p - l)7Wb ^ J 



1281og(iV/4) 
7^0 



By Markov's inequality 



P(Af n^nA 5 n 4$) < p(Af n ^ n ^ n {A(C) > i/(iV)}) 

<P({C>^)}n{A(C)>i/(iV)}) 

= P(A(C)l {c > /(iV )/2}>^/(iV)) 
2P[A((7)l{ f >/ (A r)/2}] 



< 



f(N) 



2561og(iV/4) p AT , 
< — ■ for N large enough 

->• as N -> 00. 

Because P^f n ) ->• 1 we have P^ U ->■ 1 as N ->• 00. 

Note that Ag U A$ C {T < |/(iV)}. Therefore, P(T < |/(JV)) -> 1 as JV -»• 00. Let 
£ 2 = (^'^c n (A' 2 ) c n {T < |/(iV)}. Then P(P 2 ) 1 as Af -> 00. To show P(P < /(JV)) -»■ 1 
we can show P{{B < f(N)} n P 2 ) — ^ 1- At time T, at least jN cells will be labeled either b or 
c. According to the labeling all of these cells are labeled c' so that at time T we have £ T > \N. 
By Lemma [TT] we have 



P (B T l {T<hf{N)} < ~/(JV)) -> 1 as TV 



00. 
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Note that 

^B T l {T<hf(N)} < i/(JV)J = I B r < i/(JV)J U |t > i/(iV)} . 
Because £ 2 C {T < |/(iV)} we have 

|b t i {t<|/(jv)} < n£ 2 = |bt < n £ 2 . 

It then follows that 

P (Ibt < \f( N )\ nE 2 S j -> 1 as iV -> oo. 

However, 

St < ^/W} n£ 2 c |bt < \f(N)} n |t < |/(iV)| C {B < f(N)} 

which gives the conclusion. □ 
Let V? = {i : X\ > X+ + W /4} and V t 2 = {i : X\ < - W /4}. Define 

F = mi{t : V^UV? / 0}. 
We now want to bound the time it takes for the width to increase. 

Proposition 13. Consider any sequence of initial configurations Xq that depend on N such that 
W > logiV for all N. Then 

lim P(F > log log W ) = 1. 

N^oo 

Proof. By Proposition [6] with / = Wo/4 and t = log log Wo we have 

P(inf{s : V} ^ 0} < t) = P ( sup D s > l] 

\0<s<t J 

2N(tfi) l e^ Wo+2l ^ +1 ^ 
~ (1 = 1)1 

— > as N — > oo. 

By Proposition [4] with / = Wo/4 and t = log log Wo we have 

P(inf{s : 0} < t) = P ( sup (X Q - X~) > l) 

\0<s<t J 



l\ 

as N — > oo. 



□ 



Recall that Yi = sup Si<s<t D s — D Si and that {Ft}t>o is the natural filtration associated 
with X. Note that if W < 2~logiV then for all n > 1 the width satisfies W Sn = [21ogiV]. 
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Proposition 14. Let Xq = for 1 < i < N. For N large enough we have E{Yi\J- Si ] < 51ogiV 
for all i > 1. 

Proof. We consider a sequence of initial configurations Xq depending on N such that for all iV 
we have Wq = [2 log N~\. Because Wq > 21ogiV we have si = and Y\ = sup 0<s<42 D s — Dq. 
We will show that for N large enough, J5[Yl] < 51ogiV. The result then follows because X is a 
strong Markov process. 

We make the following definitions: 

V^{s) = {i : X\ > X+ + W,/4} for s > 
V t 2 (s) = {i : X\ < X- - W s /A} for s > 
F = B = r = 

F n = inf ^ 1 (r„_i) U Vf (r n _i) / for n > 1 
*>r„-i 

B n = inf X? > X+ n _ x - W rn _jA for n > 1 

r n = F n A B n for n > 1 

n* = inf{n > 1 : W r „ < logiV}. 

Note that r\ is the first time that the event FUB occurs and that, conceptually, r n acts like the 
first time that F U B occurs when the process is started at time r n _i for n > 2. The random 
variables F n and B n play the roles of the events F and B when the processes are started at time 
r n -l- 

On the event n — 1 < n*, by Proposition [12] and the strong Markov property of X we have 
P(B n < r n _i + f(N)\J r rn _ 1 ) — > 1 uniformly on a set of probability 1 as N — > oo. Likewise, 
on the event n — 1 < n*, by Proposition [13] and the strong Markov property we also have 
P(F n > r n _i + /(-/V)|J> n _ 1 ) —7- 1 uniformly on a set of probability 1 as N — > oo. Therefore, on 
the event n — 1 < n* we have P{B n < F n \J T rn _ 1 ) — > 1 uniformly on a set of probability 1. 

Because the bounds in Propositions [12] and [13] do not depend on n we can choose a sequence 
Pn such that pn — > 1 as N — > oo and almost surely 

PN^{n~Kn,} < P(B n < ^n|-7>„_i)l{n-l<n*} 

for all n > 0. Let {S* n }^L be a random walk starting at 1 which goes down 1 with probability 
Pn and up 1 with probability 1 — pn until it reaches 0. Once S reaches it is fixed. For n < 
we couple S with X as follows: 

• Each step of the process S corresponds to a time r n . 

• On the event {F n < B n } we have S n — S n -i = 1. 

• On the event {B n < F n } we have S n — S n -i = —1 with probability pN/P(B n < F ri ) and 
we have S n — S n -\ = — 1 with probability 1 — p^ jP{B n < Fn)- 

We will show that this coupling is well-defined and gives the necessary bound. Initially, 5o = 1 
and 2 s °~ 1 Wo = W . On the event that B n < F n , we have 

1 1 

W rn < ^Wm-i and SU P A " D r n -i < 7^„-r 
z r„_i<t<r n 4 
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On the event that F n < B n , we have 



W rn < 2W rn _ 1 and sup D t - D rn _ x < \w rn _, + 1. 



Therefore, if 2 5n - 1_1 W > W Tn _ 1 then 2 Sn ~ 1 Wo > W Tn by the coupling. It follows that 
2 Sn ~ 1 W > swp rn _ 1<t<rn D t -D Tn _ 1 as well. By induction, 2 Sn ~ 1 W > W Tn for all n < n*Ainf{m : 
S m = 0}. If n = inf{m : S m = 0} then W Tn < logN. Therefore n* < inf{m : S m = 0} and the 
induction holds for all n < n*. 

We define a function d on ({0} U N)°° such that if x = (xq, xi, . . . ) then 



d(x) = J2h* i >o} 2Xi ~ 1 w . 

Consider S = (So, Si, ... ) as a random element in ({0} U N)°°. Then 

d((S ,Si,...,S n ,0,0,...))>^( sup D t -D n \> sup A 



0<t<r„ 



for all n such that n — 1 < n*. By definition, n* is the first n such that W^ n < log N. Hence 
d(S) > Yi. 

For any n > we have 

P(S2n+i = 0) = ( 2n + (1 - PnTpT < 4 n (l - Piv)>^ +1 . 

If 

S2n+i = then 

/ n \ 



d{S) < + 2 2 <_1 J W = 2 n+1 W 

which is obtained by taking n steps up followed by n + 1 steps down. 
Therefore, 



E[Y{\ < E[d(S)\ < Y,m-PN)] n pT2 n+1 Wo = , ^ nW ° ; ~ 41ogiV 
^ 8(1 Pn)Pn 



because Wq = [2 log N] and pjy — > 1 as — > oo. This shows that for N large enough we have 
E[Yi] < 51ogiV, which gives the conclusion. □ 

5 Bounding the frequency of getting large widths 

For this section let I = [logiVj and t = log log N. We define 

2N(tri) l e ( -^ Wo+2 ^ + ^ +1 ^ t 

Kl = (z=i)i and 

K2 = ^> — 
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Lemma 15. Consider any sequence of initial configurations Xq that depend on N such that 
W < log AT. Then 



P[ sup W s < 21ogiV ) > 1 - K x - K 2 . 

K 0<s<t 



Proof. By Proposition [6] we have 



P\ sup D s > I ) < Kx. 

\0<s<t 



By Corollary H] we have 



P ( sup (X - X-) >l)< K 2 . 

\0<s<t J 



If sup 0<s<4 D s < log N and sup 0<s<t X Q — X s < log N then sup 0<s<4 Wt < 2 log A 7 ". This gives 
the result. □ 

Proposition 16. Let Wq = 0. For N large enough, 

sup -E[N S ] < - 
se[i,oo) s * 



Proof. Notice that 



Therefore, 



W > »} = {«i < s} C <j - tj) < 

3=1 



P(N s >i)<P\J2( s J-tj)< 

Applying Lemma [15] and the strong Markov property of X we have 

P (sj - tj > t\F tj ) >1-Kx-K 2 
for all j. Taking expectations of both sides yields 



1 - K - Ko < E 



ird P( S j-tj > t\T tj ) 



< iniE [P{ Sj -tj > t\F t] )] 

= MP(sj - tj > t) . 

■j 

Let pn = 1 — Kx — K 2 . Note that pm — > 1 as A" — > oo. Define an i.i.d. sequence {Vi}^ of 
random variables with distribution PiVi = 0) = 1 — Pn and PiVi = t) = p^. Then 



,3=1 / \3=l 
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This will allow us to define a new process N' s such that N' s = i if 

i i+l 
3=1 0=1 

Note that P{N' S = 0) = p N for s G [0,t) and that P(iV^ > k) > P(N S > k) for all k. Therefore, 
it is enough to bound E[N' s ]/s. 

Let Vo = 0. Jumps of the process N' s only occur at points kt where /c is a positive integer. On 
the time interval [l,t) the process N' s is constant and has value max{i > : Vi = 0}. Therefore 
Ng has the shifted geometric distribution for s G [0, t) with mean (1— pn)/pn- We can now make 
use of the fact that N' s is a Markov process. If we consider values at kt for k > we have for 
s £ [(k — l)t,kt) that E[N' S ] = fc(l -p N )/PN- For k > 2 we then have 

= ^fc^ < * (1 ~ pjy) . 

a spat (k-l)p N t 



This gives us 



On the time interval [l,t) we have 



-E[N' S ] < 



(k- 


1)pjv 


(1- 


-Pn) 



as iV — > oo. 



as iV — > oo. 



PAT 



□ 



6 Proof of Theorem [T] 

Proof of Theorem^ Fix t > 1 and e > 0. There exists Nq which does not depend on t such that 
for any N > Nq we have 



E 



< E 



E 



by Proposition [9] 



D' t 



+ E 



sr N * v. 



< {l + 2 1 + e)k + -E 



Nt 



< {l + 2 1 + e)k + -E 



by Proposition [8] 



i=l . 
ATt " 

i=l 

(l + 2 7 + e )fc + I jT£[y f i {JVt > i} ] 

i=l 
_ oo 

(1 + 2 7 + e )fc + - ^[^[^l{Ar t >i}I^J] 



i=l 
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(1 + 2 7 + e)k + - J2 EWn^EIY^ 



t 

i=i 

51 2V °° 

< (l + 27 + e)fe+ ° g ^2E[l{N t >i}} by Proposition M 
* i=i 
5 log: 2V 

= (l + 2 7 + e)*+— ^— £[2V t ]. 

For 2V large enough, k <2 log 2V/ log log 2V. Therefore, for 2V large enough, 

51og2V log2V 51og2V 

(1 + 2 7 + e)k + ^p-E[N t ] < 2(1 + 2 7 + + ~^E[N t ] 

, log2V 207log2V , , 

< 2 1 + 2 7 + e ) f Af + - ' 6 AT by Proposition nS 
log log 2V log log 2V 

log 2V 

= 2(l + 12 7 + e) 



log log 2V 

□ 
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